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Abstract 

Background: Genome wide association studies (GWAS) have identified several SNPs associated with colorectal cancer (CRC) 
susceptibility. Vitamin D is also inversely associated with CRC risk. 

Methods: We examined main and joint effects of previously GWAS identified genetic markers of CRC and plasma 25- 
hydroxyvitamin D (25(OH)D) on CRC risk in three prospective cohorts: the Nurses' Health Study (NHS), the Health 
Professionals Follow-up Study (HPFS), and the Physicians' Health Study (PHS). We included 1895 CRC cases and 2806 
controls with genomic DNA. We calculated odds ratios and 95% confidence intervals for CRC associated with additive 
genetic risk scores (GRSs) comprised of all CRC SNPs and subsets of these SNPs based on proximity to regions of increased 
vitamin D receptor binding to vitamin D response elements (VDREs), based on published ChiP-seq data. Among a subset of 
subjects with additional prediagnostic 25(OH)D we tested multiplicative interactions between plasma 25(OH)D and GRS's. 
We used fixed effects models to meta-analyze the three cohorts. 

Results: The per allele multivariate OR was 1.12 (95% CI, 1.06-1.19) for GRS-proximalVDRE; and 1.10 (95% CI, 1.06-1.14) for 
GRS-nonproxVDRE. The lowest quartile of plasma 25(OH)D compared with the highest, had a multivariate OR of 0.63 (95% 
CI, 0.48-0.82) for CRC. We did not observe any significant interactions between any GRSs and plasma 25(OH)D. 

Conclusions: We did not observe evidence for the modification of genetic susceptibility for CRC according to vitamin D 
status, or evidence that the effect of common CRC risk alleles differed according to their proximity to putative VDR binding 
sites. 
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Introduction 

Both inherited and modifiable risk factors have been identified 
for colorectal cancer (CRC). To date, genome wide association 
studies (GWAS) have identified 32 SNPs at 23 independent loci 
associated with CRC [1-16]. A substantial body of evidence also 
demonstrates an inverse association between vitamin D status and 
CRC [17-24]. Nonetheless, data examining the potential interac- 
tion between genetic susceptibility to CRC and an environmental 
factor such as vitamin D status are lacking. 



There are plausible mechanisms by which genetic susceptibility 
to CRC may vary according to vitamin D status. First, one of the 
primary modes of action of vitamin D is via influence of gene 
transcription by binding of the active form 1,25-dihydroxy-vitamin 
D (l,25(OH) 2 D 3 ) to the nuclear vitamin D receptor (VDR) [25]. A 
prior study utilizing chromatin immunoprecipitation with mas- 
sively parallel sequencing (ChiP-seq) to identify vitamin D 
receptor (VDR) protein- DNA binding interactions observed that 
VDR binding sites were significantly enriched near autoimmune 
and cancer associated genes [26], including 3 previously identified 
CRC-associated SNPs. Second, although our understanding of the 
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functional implications of many CRC-associated SNPs is limited, it 
is possible that some of these loci may be associated mechanis- 
tically with pathways also influenced by vitamin D. 

Thus, we examined the joint effects of genetic markers of 
CRC previously identified by GWAS and plasma 25(OH)D on 
CRC risk in three prospective cohorts: the Nurses' Health Study 
(NHS), the Health Professionals Follow-up Study (HPFS), and 
the Physicians' Health Study (PHS). We also specifically 
explored the possibility that vitamin D status may differentially 
influence risk of CRC according to genetic variants proximal to 
regions of VDR binding demonstrated in functional ChlP-seq 
studies. 

Methods 

Study Population 

Our study included three case-control studies of CRC nested 
within the Nurses' Health Study (NHS), the Health Professionals 
Follow-up Study (HPFS) and the Physicians' Health Study 
(PHS). The NHS was established in 1976 when 121,700 US 
female registered nurses aged 30 to 55 years returned mailed 
questionnaires on risk factors for cancer and cardiovascular 
disease [27,28]. The HPFS was established in 1986 when 51,529 
male health professionals aged 40 to 75 years responded to a 
similar questionnaire [29]. In both cohorts, participants have 
returned questionnaires every 2 years to update information 
with response rates exceeding 90% [27,28]. In 1989-90, 32,826 
NHS participants and in 1993-95, 18,018 HPFS participants 
returned a blood specimen on ice packs. In 2001-04, 29,684 
women in NHS and 13,956 men in HPFS who had not 
previously provided a blood specimen mailed in a 'swish-and- 
spit' sample of buccal cells. On receipt, blood and buccal cells 
were centrifuged, aliquoted, and stored at — 70°C [30]. The 
Physicians' Health Study was a randomized, double-blind, 
placebo-controlled trial of aspirin and beta-carotene for the 
primary prevention of cancer and cardiovascular disease among 
22,071 U.S. male physicians ages 40 to 84 years enrolled in 
1982 [31]. Participants with a prior diagnosis of heart disease, 
cancer (except nonmelanoma skin cancer), renal or liver disease, 
peptic ulcer, or gout or used vitamin A or beta-carotene 
supplements, were excluded. Between 1982 and 1984, 14,916 
men (more than 70% of participants) returned blood samples by 
mail which were divided into aliquots and stored at — 82°C 
(later, at — 140°C) [32]. This study was approved by the Human 
Subjects Committee at Brigham and Women's Hospital and the 
Harvard School of Public Health in Boston, MA, USA. All 
participants provided informed consent. 

In all three cohorts, incident cases of CRC were identified 
by follow-up questionnaires and confirmed by medical records 
review or through mortality follow-up. In each cohort, up to 3 
controls were randomly selected from those who were alive and 
free of cancer at the time of case ascertainment. In NHS and 
HPFS, controls were matched to each case on ethnicity, year of 
birth and month/year of blood or buccal sampling [30]; in 
PHS, controls were additionally matched on smoking status 
[32]. 

Laboratory Assessment 

We previously measured plasma levels of 25(OH)D through a 
radioimmunosorbent assay in the laboratory of Dr. Bruce W. 
Hollis (Medical University of South Carolina, Charleston, SC). 
The median intra-assay coefficient of variation from blinded 
quality-control samples was 1 1.8% in NHS, 10.1% in HPFS, and 
13.8% in PHS. Cases and their controls were analyzed in the same 



batch, and laboratory personnel were blinded to case, control, and 
quality-control status [23,32,33]. 

Genotyping 

Genomic DNA was extracted from blood samples (HPFS, NHS, 
PHS) and buccal cells (NHS, HPFS) using conventional methods. 
We used the TaqMan Open Array SNP genotyping platform 
(Biotrove, Woburn, MA) with 384-well format TaqMan assays to 
genotype the following CRC-associated variants identified from 
previous GWAS: rs6691170, rs6687758, rsl0936599, rsl6892766, 
rs6983267, rsl0795668, rs3802842, rsl0505477, rs7014346, 
rs7136702, rsl 1169552, rs4444235, rs4779584, rs9929218, 
rs4939827, rsl0411210, rs961253, rs4925386 [4,8-10,12,15,16]. 
TaqMan primers and probes were designed using Primer Express 
Oligo Design software v2.0 (ABI PRISM). Primers, probes, and 
conditions for genotyping assays are available upon request. We 
genotyped rs2 151512 on Taqman as a surrogate for rs4925386 
(linkage disequilibrium r 2 1.0 in the HapMap CEU population) 
since genotyping by Taqman for rs4925386 on chromosome 
20ql 3.33 was not successful.A subset of participants (954 cases and 
1328 controls) had blood genomic DNA successfully genotyped 
using Illumina HumanOmniExpress. Missing SNP data was 
imputed to HapMap II release 22 using MACH [34]. All 
genotyping underwent standard quality control including concor- 
dance checks for blinded and unblinded duplicates and examina- 
tion of sample and SNP call rates. The call rate was >97% for all 
samples and >98% for all SNPs. 

Statistical Analysis 

We included a total of 1895 CRC cases and 2806 controls with 
genotype information assembled from NHS, HPFS, and PHS for 
our GRS. Within each cohort, we calculated allelic odds ratios and 
95% confidence intervals for CRC associated with each SNP and 
for genetic risk scores (GRSs). The GRS is an allelic scoring system 
incorporating each of the specified risk alleles associated with CRC 
based on prior GWAS to assign a single quantitative index of 
genetic risk to each subject. Our GRS assumes an additive allelic 
effect with carriage of an increasing number of copies of each risk 
variant. 

We constructed a GRS comprised of 18 Taqman CRC 
susceptibility SNPs (GRS- 18). We also examined GRSs comprised 
of a subset of these SNPs based upon the 10 SNPs examined in. a 
ChiP-seq analysis (GRS- 10) conducted by Ramagopalan et al. In 
this analysis, 3 CRC-associated SNPs were proximal (within 
150 kb on either side of main disease-associated SNP) to increased 
VDR binding (vitamin D response elements (VDRE)) (GRS- 
proximalVDRE), and 7 SNPs were not proximal to areas of 
increased VDR binding (GRS-nonproxVDRE) [26]. Among the 
2,282 subset of individuals with additional GWAS data, we 
created a GRS comprised of 31 CRC associated SNPs from 
GWAS (GRS-31). We also calculated GRSs using the effect 
estimates and standard errors for these same SNPs from large scale 
GWAS [35] and tested for differences in the GRS-proximalVDRE 
and GRS-nonproxVDRE. 

For analyses of the joint effect of plasma 25(OH)D and GRS- 18, 
we included the 881 cases and 1566 controls in NHS, HPFS, and 
PHS who also had previously measured 25(OH)D prior to CRC 
diagnosis [23,32]. This was repeated for the joint effects of GRS- 
31 and pre-CRC 25(OH)D levels among the 672 cases and 909 
controls with both GWAS data and vitamin D levels. We 
calculated odds ratios and 95% confidence intervals for CRC 
associated with each 1 ng/mL increase in 25(OH)D; high versus 
low vitamin D according to a threshold level of 25(OH)D 
associated with lower CRC risk (a 32 ng/mL); and quartiles of 
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25(OH)D [32]. We tested for multiplicative interactions between 
GRSs and vitamin D using a product term in the model and 
assessing its significance by the Wald method. 

We adjusted our genetic analyses for age at sample collection, 
race, cohort, and type of sample (blood or cheek). Analyses which 
also incorporated plasma 25(OH)D additionally adjusted for 
season of blood draw and 25(OH)D analysis batch. We also used 
multivariate models which included additional CRC risk factors, 
including regular aspirin use (yes or no), body mass index (BMI; in 
textiles), physical activity (in tertiles), history of CRC in a parent or 
sibling (yes or no), smoking status (never, former or current 
smoker), alcohol consumption (0-4.9, 5-9.9, 10-14.9 or a 15.0 g 
per day), and consumption of beef, pork or lamb as a main dish (0- 
3 times per month, once a week, 2-4 times per week or &5 times 
per week). In NHS and HPFS we also included regular non- 
steroidal anti-inflammatory drugs (NSAIDs) use (yes or no) and 
energy-adjusted calcium and folate intake (in tertiles). Study 
specific estimates were meta-analyzed to determine a combined 
OR and 95% CI using inverse variance weights [36]. Fixed effect 
model estimates were used as all tests of heterogeneity were non- 
significant (p>0.05). 

Results 

Our analysis included 1895 CRC cases and 2806 controls 
assembled from the three cohorts. Table 1 summarizes the 
baseline characteristics of each cohort. The direction and 
magnitude of our individual SNP effect estimates were comparable 
to those of the original reports as well as those observed in the 
Genetics and Epidemiology of Colorectal Cancer Consortium 
(GECCO) and Colon Cancer Family Registry (CCFR). GECCO/ 
CCFR encompasses 1 3 studies, including NHS, HPFS, and PHS 
with genetic information on a total of 10,061 cases and 12,768 
controls (Table 2). 

The additive GRSs comprised of the CRC risk alleles yielded 
effect estimates for CRC within our three cohorts that were 
comparable with those derived from all 13 cohorts encompassed 
by GECCO and CCFR (Table 3). We also compared the 
estimates obtained from models with the GRS-proximalVDRE 
and GRS-nonproximalVDRE scores alone with models contain- 
ing both GRSs (conditional models) and did not observe a material 
difference. There was no significant difference between a GRS- 
VDR (multivariate OR, 1.12; 95% CI 1.06, 1.19), and a GRS- 
nonproximalVDRE (multivariate OR, 1.10; 95% CI 1.06, 1.14) 
(p-heterogeneity = 0.52). 

Consistent with our prior reports [23,32], we observed an 
inverse association between 25(OH)D and CRC in a meta- 
analysis of results from the three cohorts (Table 4). Compared 
with individuals in the lowest quartile of 25(OH)D, men and 
women in the highest quartile had a multivariate OR for CRC 
of 0.63 (95% CI 0.48-0.82; p-trend<0.001), and for those with 
higher levels of 25(OH)D (> 32 ng/mL) compared with lower 
levels, the multivariate OR was 0.79 (95% CI 0.65-0.98; 
p = 0.03). 

Tests of multiplicative interactions between continuous GRSs 
and continuous 25(OH)D did not yield statistically significant 
results, nor did tests of multiplicative interactions between 
GRSs and vitamin D categorized by a threshold level of 
32 ng/mL. Likewise, the risk estimates associated with GRS31 
did not vary according to quartile of vitamin D (phet = 0.98) 
(Figure 1). 
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Table 3. Comparison of GRS-31, GRS-18, GRS-10, GRS-proximalVDR and GRS-nonproxVDR. 









GRSproximalVDR" 


GRSnonproxVDR 1 ' 


GRS-10' OR 


GRS-18 8 OR 


GRS-31 E OR 




n (cases/controls) 


OR (95% CI) 


OR (95% CI) 


(95% CI) 


(95% CI) 


(95% CI) 


NHS 


2358 (922/1436) 


1.10 (1.01, 1.20) 


1.13 (1.07, 1.19) 


1.12 (1.07, 1.17) 


1.07 (1.04, 1.10) 


1.05 (1.02, 1.09) 


HPFS 


1457 (586/871) 


1.13 (1.01, 1.26) 


1.08 (1.01, 1.15) 


1.09(1.03, 1.16) 


1.05 (1.01-1.09) 


1.04 (0.98, 1.10) 


PHS 


886 (387/499) 


1.15 (1.01, 1.31) 


1.07 (0.98, 1.17) 


1.09 (1.01, 1.18) 


1.07 (1.02-1.12) 


1.05 (1.00, 1.10) 


Meta analysis 


4701 (1895/2806) 


1.12 (1.06, 1.19) 


1.10 (1.06, 1.14) 


1.10 (1.07, 1.14) 


1.06 (1.04-1.09) 


1.05 (1.02, 1.08) 


GECCO 


22829 (10061/12768) 


1.11 (1.09, 1.14) 


1.09 (1.07, 1.11) 


1.10 (1.08, 1.11) 


1.08 (1.07, 1.09) 


1.08 (1.07, 1.09) 



otGRS-proximalVDR: additive genetic risk score used to estimate a per allele OR and 95% CI. Includes SNPs; rs3802842, rs4939827, rs1 041 1210. 

PGRS-nonproxVDR: additive genetic risk score used to estimate a per allele OR and 95% CI. Includes SNPs; rs6983267, rs1 6892766, rsl 0795668, rs4444235, rs4779584, 

rs9929218, rs961253. 

xGRS-10: additive genetic risk score used to estimate a per allele OR and 95% CI. Includes SNPs; rs3802842, rs4939827, rs10411210, rs6983267, rs16892766, rs1 0795668, 
rs4444235, rs4779584, rs9929218, rs961253. 

8GRS-18: additive genetic risk score used to estimate a per allele OR and 95% CI. Includes SNPs; rs3802842, rs4939827, rs1041 1210, rs6983267, rs16892766, rs10795668, 
rs4444235, rs4779584, rs9929218, rs961253, rs6691170, rs6687758, rs10936599, rs10505477, rs7014346, rs7136702, rsl 1169552, rs2151512. 

eGRS-31: additive genetic risk score used to estimate a per allele OR and 95% CI based upon the subset of cases and controls with GWAS data (394 cases/772 controls in 
NHS; 228 cases/222 controls in HPFS; 332 cases/332 controls in PHS). Includes SNPs; rs3802842, rs4939827, rs10411210, rs6983267, rs16892766, rs10795668, rs4444235, 
rs4779584, rs9929218, rs961253, rs6691170, rs6687758, rs10936599, rs7136702, rsl 1 169552, rs4925386, rs1091 1251, rs11903757, rs647161, rsl 321 311, rs719725, 
rs3824999, rs3217810, rs3217901, rsl0774214, rs59336, rs1957636, rsl 1632715, rs16969681, rs2423279, rs4813802. 
doi:1 0.1 371 /journal.pone.009221 2.t003 



Discussion 

There is substantial evidence supporting an inverse association 
between circulating 25(OH)D and CRC risk; meta-analyses and 
systematic reviews have observed a 50% lower risk of CRC 
comparing extreme quintiles of 25(OH)D [21,24]. Several 
mechanisms have been hypothesized to underlie this association, 
some of which may be shared by pathways associated with the 
putative functional consequences of CRC susceptibility SNPs 
proximal to VDR-DNA binding sites. In addition, vitamin D 
signaling occurs through binding of the active form l,25(OH)D to 
nuclear vitamin D receptor (VDR) along specific genomic 
sequences known as vitamin D response elements (VDREs),) 
which act to activate or repress gene transcription [25]. In our 
study of 1895 CRC cases and 2806 controls nested within three 
prospective cohorts, we did not observe statistically significant 
evidence for interaction between CRC genetic markers, including 
the 3 CRC SNPs identified in a prior CHiP-seq analysis as being 
adjacent to VDR-DNA binding sites, and plasma 25(OH)D on 
CRC risk, despite observing an increased risk for CRC associated 
with GRSs and an inverse association between circulating 
25(OH)D levels and CRC. [26]. 



The lack of a significant interaction between 25(OH)D and 
genetic susceptibility to CRC may have several explanations. First, 
pathways associated with CRC susceptibility loci may in fact be 
distinct or overlap minimally with mechanisms associated with 
vitamin D. Second, our GRS and single measures of plasma 
25(OH)D measures may be relatively insensitive or incomplete 
markers of relevant biological pathways shared by genetic 
susceptibility and vitamin D. Third, we constructed GRSs 
informed by the results of a functional study by Ramagopalan et 
al. that applied ChiP-seq in lymphoblastoid cells treated with 
calcitriol for 36 hours to demonstrate subsequent differential 
association of vitamin D receptor binding with three specific CRC 
susceptibility loci [26] . However, a separate study utilizing ChlP- 
Seq in monocytes treated with calcitriol for 40 minutes found only 
18% of calcitriol-stimulated VDR-binding sites common to those 
observed in Ramagopalan et al. [37,38]. These results suggest that 
VDR target gene regulation may differ according to cell type and/ 
or duration of vitamin D exposure which may be difficult to assess 
using circulating measures of 25(OH)D. 

Our study has several strengths. First, the availability of both 
genetic information and prediagnostic measures of plasma 
25(OH)D in our three cohorts permitted, to our knowledge, the 



Table 4. Risk of colorectal cancer associated with circulating 25(OH)D within NHS, HPFS and PHS. 







NHS 


HPFS 


PHS 


Meta-analysis 


N (case/control) 


352/665 


277/535 


252/366 


881/1566 


Continuous 25(OH)D 


0.98 (0.97, 1 .00) 


0.99 (0.97, 1.01) 


1.01 (0.99, 1.03) 


0.99 (0.98, 1.00) 


High vitamin D (> 32 ng/ml) 


0.76 (0.54, 1 .08) 


0.77 (0.56, 1 .05) 


0.91 (0.59, 1.41) 


0.79 (0.65, 0.98) 


Quartile 1 


1.00 (referent) 


1 .00 (referent) 


1.00 (referent) 


1.00 (referent) 


Quartile 2 


0.77 (0.53, 1.12) 


0.90 (0.60, 1 .35) 


0.70 (0.41, 1.19) 


0.80 (0.62, 1.02) 


Quartile 3 


0.58 (0.39, 0.87) 


0.64 (0.41, 0.99) 


0.91 (0.54, 1.52) 


0.67 (0.52, 0.86) 


Quartile 4 


0.57 (0.38, 0.87) 


0.67 (0.42, 1 .05) 


0.67 (0.39, 1.17) 


0.63 (0.48, 0.82) 
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Figure 1. GRS-31 and CRC stratified by quartiles of 25(OH)D. 

Demonstrated no evidence for variation in CRC risk estimates 
associated with GRS-31, stratified across quartiles of 25(OH)D 
(phet = 0.98). 

doi:1 0.1 371 /journal.pone.009221 2.g001 

first analysis of the effect of CRC susceptibility loci according to an 
integrated biomarker of an environmental determinant of CRC 



risk. Second, our reasonably large sample size provided individual 
SNP and GRS associations that were similar in the direction and 
magnitude with estimates from larger cohorts, including the 
GECCO and CCFR consortium. We acknowledge limitations of 
our study, including a single measurement of 25(OH)D which may 
not reflect long-term vitamin D status or the tissue-specific effects 
of vitamin D. We also had a more limited sample size of 
participants with both genetic information and measured levels of 
plasma 25(OH)D. 

In summary, in this large study of CRC cases and controls 
characterized for genetic susceptibility to CRC with prediagnostic 
measurements of 25(OH)D levels, we did not observe evidence for 
the modification of genetic susceptibility for CRC according to 
vitamin D status. 
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